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Frantz 



Enclosed are art findings of the 103 variety, some of which you viewed on Friday. I took the liberty of 
flagging as well as highlighting some others that I felt would be of direct relevance — on top of what I 
showed you on Friday. However, you might want to look at some of the others in the event that your 
vantage point may provide you with the ability to see great pertinence in some of the others. Moreover, I 
have saved the retrieved buffers from the searches in the event that you may perhaps be interested in 
something peripheral to our approach (say the image extraction component of conversion, some of which are 
included in this packet). Do let me know if I can be of further assistance. 

Byron 
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Set Items Description 

51 4669973 INTELLECTUAL () PROPERT? OR PATENT? OR COPYRIGHT? OR TRADEMA- 

RK? 

52 20577 TRADE 0 DRESS? OR DESIGN () RIGHT? OR PROPRIETARY () INFORMATIO- 

N? 

53 2 LEGALLY 0 PROTECTABLE 0 KNOWLEDGE OR T7\NG I BLE () RESEARCH () PRO- 

PERTY 

54 1067220 DATABASE OR DATABANK OR DATA() (BASE? OR BANK? OR FILE? OR - 

REPOSITOR? OR WAREHOUSE?) OR DB OR RDB OR OODB OR ODBC OR DBMS 

55 10506358 NETWORK? OR NET? ? OR INTERNET? OR INTRANET? OR LAN? ? OR - 

WAN? ? OR ONLINE 

56 7077174 ETHERNET? OR EXTRANET? OR WWW OR WORLD () WIDE {) WEB OR WORLD- 

WIDEWEB OR SUBNET? 

57 4963437 WEBSITE? OR WEB () SITE? OR WEBPAGE? OR WEB () PAGE? OR WEB()A- 

DDRESS? OR URL?? OR URI?? 

58 5522278 HOMEPAGE? OR HOME () PAGE? OR FRONTPAGE? OR FRONT () PAGE? OR - 

SITE? OR HTML 0 FILE? 

59 1096792 SI : S3 ( ION) S4 : S8 

510 335256 SEARCH? OR RESEARCH? OR RETRIEV? OR INQUIR? OR QUERY? OR Q- 

UERIES OR EXAMIN? OR INSPECT? 

511 217257 REQUEST? OR DATA () MINE? OR DATA () MINING? OR DATAMINE? OR D- 

ATAMINING? OR FIND? OR DISCOVER? 

512 104776 INTERROGAT? OR WEBCRAWL? OR WEB () CRAWL? OR METACRAWL? OR M- 

ETA{) CRAWL? OR SEEK? OR SORT? OR HUNT? 

513 3488 JAPIO OR JPO OR EPO OR USPTO OR WIPO 

514 472633 STORE OR STORING OR MEMORY OR ACCUMULAT? OR RECEIV? OR ACC- 

EPT? OR ACQUIR? OR OBTAIN? OR CULL? OR CACHE? 

515 296764 STOCK? OR COLLECT? OR GATHER? OR GLEAN? OR AMASS??? OR ACC- 

RU? OR AGGREGAT? OR COMPIL? OR SIFT? OR CACHING 

516 556334 PULL () DOWN? OR TAKE? OR STORAGE? OR TAKING? OR DERIV? OR P- 

ROCUR??? OR GET? OR TAP? ? OR CAPACIT? 

517 114549 CONVERT OR CONVERTS OR CONVERTING OR CONVERSION? OR TRANSF- 

ORM? OR ALTER??? OR REFORMAT? OR EXTRACT? 

518 364108 MODIF? OR REVIS??? OR TRANSLAT? OR REMODEL? OR ADAPT? OR C- 

HANGE OR CHANGE? ? OR CHANGING? OR COLLAT? 

519 625636 FIRST? OR 1ST OR PRIMARY OR INITIAL? OR ORIGINAL? OR LEADO- 

FF? OR MAIN OR CHIEF OR INTRODUCTORY? 

520 438559 SECOND? OR 2ND OR DOUBL? OR TWIN? OR EXTRA? OR DUPLICAT? OR 

ANOTHER OR SUBSIDIAR? OR AUXILIAR? 

521 349961 THREE? OR TRIO? OR TRIUNE? OR TRIAD? OR TRIPL? OR TERTIAR? 

OR THIRD OR 3RD 

522 19714 (S9 OR S13 ) ( 5N) SIO : S12 

523 4776 S22 AND (S14:S16 AND S17 : S18 ) ( ION) (SI : S8 ) 

524 3664 S23 AND (S17:S18 AND S19 : S21) ( ION) (SI : S8 ) 

525 1673 S24 AND S17 : S18 (ION) (SI : S3 OR S14:S16) 

526 1474 S25 AND (S14:S16 AND S17:S18) (5N) (S1:S8) 

527 888 S26 AND SI : S3 (5N) S4 : S8 

528 881 S27 AND S1(5N)S4:S8 

529 225 S27 AND S1(5N)S7:S8 

530 659 S28 AND S1(3N)S4:S8 

531 117 S29:S30 AND S17 : S18 ( ION) S19 : S2 1 AND Sl(lON) (S7:S8) 

532 39 S31 AND {S14:S16 AND S17:S18) (5N) (S19:S21 AND SI) (5N)S7:S8 

533 242 S29 OR S31:S32 

534 636696 PD>2000 

535 78 S33 NOT S34 

536 65 RD (unique items) 
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(c) 2005 Reed Business Information Ltd. 
File 370:Science 1996-1999/ Jul W3 



(c) 1999 AAAS 
File 484 : Periodical Abs Plustext 1986 -2005/Dec W2 

(c) 2005 ProQuest 
File 553:Wilson Bus. Abs. FullText 1982 -2004/Dec 

(c) 2005 The HW Wilson Co 
File 610:Business Wire 1999-2005/Dec 19 

(c) 2005 Business Wire. 
File 613: PR Newswire 1999-2005/Dec 14 

(c) 2005 PR Newswire Association Inc 
File 621:Gale Group New Prod. Annou . (R) 1985-2005/Dec 19 

(c) 2005 The Gale Group 
File 624 : McGraw-Hill Publications 1985-2005/Dec 16 

(c) 2005 McGraw-Hill Co. Inc 
File 634: San Jose Mercury Jun 1985-2005/Dec 16 

(c) 2005 San Jose Mercury News 
File 635:Business Dateline (R) 1985-2005/Dec 19 

(c) 2005 ProQuest Inf o&Learning 
File 636:Gale Group Newsletter DB(TM) 1987-2005/Dec 19 

(c) 2 005 The Gale Group 
File 647: CMP Computer Fulltext 1988 -2005/Dec W2 

(c) 2005 CMP Media, LLC 
File 674 :Computer News Fulltext 1989-2005/Oct W2 

(c) 2005 IDG Communications 
File 696:DIALOG Telecom. Newsletters 1995-2005/Dec 16 

(c) 2005 Dialog 
File 810:Business Wire 1986-1999/Feb 28 
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File 813 :PR Newswire 1987-1999/Apr 30 

(c) 1999 PR Newswire Association Inc 



36/3, K/20 (Item 11 from file: 484) 

DIALOG (R) File 4 84 : Periodical Abs Plustext 
(c) 2005 ProQuest. All rts. reserv. 

04031281 (USE FORMAT 7 OR 9 FOR FULLTEXT) 
Searching for images by similarity online 

Jacso, Peter 

Online (ONL) , v22 n6 , p99-104, p. 5 
Nov 1998 

ISSN: 0146-5422 JOURNAL CODE: ONL 

DOCUMENT TYPE; Feature 

LANGUAGE: English RECORD TYPE: Fulltext; Abstract 

WORD COUNT: 2188 

TEXT: 

... as well . 

This article is an overview of how the image retrieval software from 
these three companies works, based on demo image collections on their 
Web sites . Their software is usually licensed to third -party vendors 
for specific applications, unlike the software employed by the major search 
engines . But ... 

...image search engines, but it may have to do with the size of the image 
collection at its demo site . 

The most telling test searches were related to national flags (Figure 
1) . These are very. . .as the San Francisco Museum of Modern Art or the 
French Ministry of Culture, that want to offer searchable art 
collections by similarity measures. 

Although only three attributes (color layout, color percentage, and 
texture) can be used as search criteria for most... 

...coarseness, contrast, and presence or absence of directionality of lines 
in images. 

IBM's demo site contains the two collections that demonstrate best 
the power of the software in finding similar images. One is the... 

...sound like high-tech software gadgets for artsy folks, but a look at 
IBM's trademark image collection on its demo site reveals how 
important similarity searching in business (and science) applications can 
be. Developing, introducing, and protecting a distinctive sign or... 

...the Coca Cola trademark, or the shape of the Continental and Nike logos. 

Among the first steps in developing a trademark is to find if 
there are similar logos or designs. This can be a costly... 

...40 per hour, but it is available only in the Virginia office. The CD-ROM 
trademark collection is rather expensive and excels in smart text 
searching. Consulting the trademark archive on microfiche... 

...the Web free of charge. It is possible that IBM will do it again with 
trademarks . This is important because while there are good patent 
databases online with sophisticated search facilities, trademark 
searching depends on structure and pattern recognition that none of the 
currently existing trademark databases offer. . . 

...is far more simple and efficient to search by design pattern. Although 
IBM's sample collection is very small (1,000 trademarks ), the test 



searches proved the excellent quality of the QBIC software as customized 
for trademark. . . 

...selected as the base figure. By clicking on the base image, its shape 
component is extracted and displayed to the left of the original 
trademark , and the software goes out to find images with similar shape 
components. Figure 4 shows .mostly done automatically, indexing of images 
will follow this model, especially for images stored on Web pages . 
Image collections are likely to use both human indexing and 
classification and automatic extraction of visual attributes to 
facilitate finding highly relevant images swiftly. As this happens, image 
collections . . . 



36/3, K/53 (Item 4 from file: 636) 

DIALOG (R) File 636:Gale Group Newsletter DB{TM) 
(c) 2005 The Gale Group. All rts . reserv. 

03684468 Supplier Number: 47948076 (USE FORMAT 7 FOR FULLTEXT) 
MAPIT: NEW PATENT SEARCH SERVICE TOOL 
Online Newsletter, vl8; n9, pN/A 
Sept 1, 1997 

Language: English Record Type: Fulltext 
Document Type: Newsletter; Trade 
Word Count : 32 8 

(USE FORMAT 7 FOR FULLTEXT) 
TEXT: 

. . .developed by Manning & Napier Information Services (MNIS) is now being 
demonstrated via the Internet's World Wide Web ( WWW ). MAPIT, which 
the company describes as its " patent data mining tool" compares the 
contents of thousands of patents and automatically produces visualizations 
of the research results. The MNIS technology behind MAPIT was first 
developed for government intelligence agencies, and later adapted for use 
in major corporations and patent offices around the world, according to 
MNIS. The service is offered as either an outsourced product or for 
installation on a corporate network . MAPIT applies advanced technology to 
patent research by which large patent portfolios can be analyzed 
identify similarities, differences, and relationships. The research results 
can be visually. . . 

...a detailed analysis of possible infringement. The online MAPIT 
demonstration may be viewed at: http:// www .mnis. net /MAPITdemo or 
through a patent research service offered on the Internet WWW by 
IBM by following the "Resource Page" link at IBM's website : http:// 
patent .womplex.ibm.com/respage.html The IBM website is an electronic 
"front door" to a database of patents issued in the U.S. during the 
last 27 years, plus 17 years of patent images. The IBM site contains 
over 2 million patents . In addition to its sophisticated, natural 
language processing technology, MAPIT incorporates the same DB2 -based 
technology that IBM uses on its site to store over 100GB of patent 
data. MAPIT will be available as a value-added option to companies which 
purchase their. . . 



36/3, K/57 (Item 8 from file: 636) 

DIALOG (R) File 636:Gale Group Newsletter DB (TM) 
(c) 2005 The Gale Group. All rts . reserv. 

02248919 Supplier Number: 44298267 (USE FORMAT 7 FOR FULLTEXT) 
IMAGES OF HISTORY CAPTURED BY MODERN TECHNOLOGY 

Document Imaging Report, v3 , n25, pN/A 
Dec 15, 1993 

Language: English Record Type: Fulltext 
Document Type: Newsletter; Trade 
Word Count: 998 

ownership documents that date back as far as 1788. Federal land 
records --field notes, survey plats, patent records, tract books and case 
files--for the states bordering and east of the Mississippi... 

. . .million, six-year project is in its second year and is expected to scan 
and store more than seven million of BLM's Government Land Office (GLO) 
records. So far, GLO has processed more than 1.6 million documents. 
Expense . . . 

. . .age and size of the documents, particular care is exercised in the 
scanning process. The land patents , as they are called, are on 
parchment paper and measure approximately 14 X 20 inches... 
...Each scanner is capable of processing more than 1,250 images per day 
with each patent taking about ten seconds . 

After capture, the image is sent via a Novell network to an indexer, 
who extracts 37 items of information from the actual image. Data is 
indexed into fields and entered. . . 

...do any image twice," says Lamar McCown, DCI ' s project manager for the 
digital record conversion process. 

BLM researchers are now able to electronically access land patents 
from Florida, Louisiana, Arkansas, Ohio, Michigan, Wisconsin and Minnesota 
via NEC PowerMate 486SX/25e PC... 

...so records can be located in less than a minute. 

Retrieval requires less than 15 seconds . Access to the database 
costs $2.00 per query session minute and printed images cost $0.13 per 
page . 

On- site access improved, but off- site image retrieval even 
better 

Most patent requests come from title attorneys and abstract 
companies that before they insure a parcel of land. . . 

...the BLM for the patents. 

Rather than have investigators travel to Virginia to access a patent 
, the BLM is implementing a off- site retrieval and fax service so users 
can query their searches by modem. "The system will free... 
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Set Items Description 

51 3235538 INTELLECTUAL {) PROPERT? OR PATENT? OR COPYRIGHT? OR TRADEMA- 

RK? 

52 25677 TRADE ODRESS? OR DESIGN () RIGHT? OR PROPRIETARY () INFORMATIO- 

N? 

53 2 LEGALLY 0 PROTECTABLE 0 KNOWLEDGE OR TANGIBLE {) RESEARCH {) PRO- 

PERTY 

54 1517121 DATABASE OR DATABANK OR DATA { ) (BASE? OR BANK? OR FILE? OR - 

REPOSITOR? OR WAREHOUSE?) OR DB OR RDB OR OODB OR ODBC OR DBMS 

55 12447445 NETWORK? OR NET? ? OR INTERNET? OR INTRANET? OR LAN? ? OR - 

WAN? ? OR ONLINE 

56 6270238 ETHERNET? OR EXTRANET? OR WWW OR WORLD () WIDE () WEB OR WORLD- 

WIDEWEB OR SUBNET? 

57 3696931 WEBSITE? OR WEB () SITE? OR WEBPAGE? OR WEB () PAGE? OR WEB()A- 

DDRESS? OR URL?? OR URI?? 

58 5077209 HOMEPAGE? OR HOME {) PAGE? OR FRONTPAGE? OR FRONT {) PAGE? OR - 

SITE? OR HTML 0 FILE? 

59 870949 SI : S3 { ION) S4 : S8 

510 261432 SEARCH? OR RESEARCH? OR RETRIEV? OR INQUIR? OR QUERY? OR Q- 

UERIES OR EXAMIN? OR INSPECT? 

511 185008 REQUEST? OR DATA () MINE? OR DATA () MINING? OR DATANINE? OR D- 

ATAMINING? OR FIND? OR DISCOVER? 

512 101224 INTERROGAT? OR WEBCRAWL? OR WEB () CRAWL? OR METACRAWL? OR M- 

ETAO CRAWL? OR SEEK? OR SORT? OR HUNT? 

513 4310 JAPIO OR JPO OR EPO OR USPTO OR WIPO 

814 378483 STORE OR STORING OR MEMORY OR ACCUMULAT? OR RECEIV? OR ACC- 
EPT? OR ACQUIR? OR OBTAIN? OR CULL? OR CACHE? 

515 237999 STOCK? OR COLLECT? OR GATHER? OR GLEAN? OR AMASS??? OR ACC- 

RU? OR AGGREGAT? OR COMPIL? OR SIFT? OR CACHING 

516 434143 PULL 0 DOWN? OR TAKE? OR STORAGE? OR TAKING? OR DERIV? OR P- 

ROCUR??? OR GET? OR TAP? ? OR CAPACIT? 

517 95843 CONVERT OR CONVERTS OR CONVERTING OR CONVERSION? OR TRANSF- 

ORM? OR ALTER??? OR REFORMAT? OR EXTRACT? 

518 306561 MODIF? OR REVIS??? OR TRANSLAT? OR REMODEL? OR ADAPT? OR C- 

HANGE OR CHANGE? ? OR CHANGING? OR COLLAT? 

519 526506 FIRST? OR 1ST OR PRIMARY OR INITIAL? OR ORIGINAL? OR LEADO- 

FF? OR MAIN OR CHIEF OR INTRODUCTORY? 

520 375558 SECOND? OR 2ND OR DOUBL? OR TWIN? OR EXTRA? OR DUPLICAT? OR 

ANOTHER OR SUBSIDIAR? OR AUXILIAR? 

521 296584 THREE? OR TRIO? OR TRIUNE? OR TRIAD? OR TRIPL? OR TERTIAR? 

OR THIRD OR 3RD 

522 19272 (S9 OR S13 ) ( 5N) SIO : S12 

523 5534 S22 AND {S14:S16 AND S17 : S18 ) ( ION) (SI : S8 ) 

524 4556 S23 AND (S17:S18 AND S19 : S21) ( ION) (SI : S8 ) 

525 2354 S24 AND S17 : S18 ( ION) (SI : S3 OR S14:S16) 

526 2154 S25 AND (S14:S16 AND S17:S18) (5N)S1:S8 

527 1550 S26 AND SI : S3 ( 5N) S4 : S8 

528 538 S27 AND S1(5N)S7:S8 

529 250 S28 AND S17:S18(10N) S19:S21 AND S1(10N)S7:S8 

530 96 S29 AND (S14:S16 AND S17:S18) (5N) (S19:S21 AND SI) (5N)S7:S8 

532 615440 PD>2000 

533 22 S30 NOT S32 

534 60 S29 NOT S32 

535 60 S34 NOT S32 

536 60 S33 OR S35 

537 44 RD (unique items) 

File 9:Business & Industry(R) Jul/1994 -2005/Dec 16 

(c) 2 005 The Gale Group 
File 13:BAMP 2005/Dec W2 

(c) 2 005 The Gale Group 
File 15 :ABl/lnform(R) 1971-2005/Dec 16 



(c) 2005 ProQuest Inf o&Learning 
File 16:Gale Group PROMT (R) 1990-2005/Dec 16 

(c) 2 005 The Gale Group 
File 47:Gale Group Magazine DB(TM) 1959-2005/Dec 16 

(c) 2005 The Gale group 
File 75:TGG Management Contents (R) 86-2005/Dec W2 

(c) 2 005 The Gale Group 
File 88:Gale Group Business A.R.T.S. 1976-2005/Dec 16 

(c) 2005 The Gale Group 
File 98:General Sci Abs/Full -Text 1984-2004/Dec 

(c) 2005 The HW Wilson Co. 
File 141:Readers Guide 1983 -2004/Dec 

(c) 2005 The HW Wilson Co 
File 148:Gale Group Trade & Industry DB 1976-2005/Dec 16 

{c)2005 The Gale Group 
File 160:Gale Group PROMT (R) 1972-1989 

(c) 1999 The Gale Group 



21/2,^/^ (Item 3 from file: 13) 

DIALOG (R) File 13 :BAMP 

(c) 2005 The Gale Group. All rts. reserv. 

00575653 Supplier Number: 24187255 (USE FORMAT 7 OR 9 FOR FULLTEXT) 

Search Engine Designer for Tomorrow: Interview with TextWise's Elizabeth 
Liddy 

(A description of the capabilities offered by the database company TextWise 

and its DR-Link search system) . 
Article Author (s) : Quint, Barbara 
Searcher, v 6, n 3, p 19-22 
March 1998 

DOCUMENT TYPE: Journal; Interview & speech ISSN: 1070-4795 (United States 
) 

LANGUAGE: English RECORD TYPE: Fulltext; Abstract 
WORD COUNT: 2354 

(USE FORMAT 7 OR 9 FOR FULLTEXT) 

ABSTRACT : 

. . .available on the desktops of corporate workers and institutional 
researchers to assist in the retrieval, extraction , and analysis of 
information. A Ph.D from the School of Information Studies at the... 

...she was instrumental in the development of DR-Link, the search engine 
for the MNIS online information service which was originally developed 
for military intelligence analysts. Liddy considers information analysis 
and knowledge extraction as the core business of TextWise. The company is 
developing CHronological Information Extraction System (CHESS) , a 
business tool for the information specialist involved in competitive 
intelligence. Article includes... 

TEXT: 

...Editor, Searcher Magazine 

Where will tomorrow's Roger Summit come from? Who is designing the 
database management systems that will power information systems in the 
Third Millennium? We decided to talk to Elizabeth (Liz) Liddy of TextWise 
and Syracuse University, creator. . . 
...The key point is analysis. 

BQ: What direction do you think advanced search engines will take ? 

Liddy: Two ways -- either on intranets , the next big market, or as 
improved ways to access open sources, like traditional newspaper. . . 

...across potentially relevant documents. They need to see the themes, the 
relationships between documents. They want to hyperlink straight to the 
answer and get it "pre-read" for them. 

For example, one of the applications we're doing for... 

...of the content of documents. Term matching is still the same. They still 
haven't extracted all the meaning intended in the document. For true 
natural language processing, you have to. . . 



. the documents . 



At this point, we think it's too hard for some oldsters to change . At 
TextWise, we don't want to just sell an engine to librarians, we want 
to work with them as customers, look at problems, take a consulting role, 
and design products and services with full capability and precision. There 
are . . . 

. . .ve had some interesting discussions. We see our core business as 
information analysis and knowledge extraction , as building systems to 
track and visualize all the connections. When we build systems that... 

...so important. The other key thing is data mining, knowledge discovery, 
not just finding and extracting . 

We develop our technologies for the government and then commercialize them. 
Data mining for companies . . . 

...manual insertion of data. We can process 100,000 gigabytes of text and 
create a database automatically. We can extract data from text and move 
it into a different format, then let the data mining... 

...1998, we are going to offer a new system for sale called CHESS 
(CHronological Information Extraction System). It's meant for the 
information specialist in a large agency doing competitive intelligence... 

...searchable quickly. Natural language processing has proven itself as 
offering great assistance to information retrieval, extraction , and 
analysis. It really works. It works fast. It works well. Adopt it. 

Once we . . , 

...Napier Information Systems (MNIS) , TextWise's commercial partner. 
Commercial products based on DR-LINK, the original TextWise information 
retrieval system, are now available through MNIS (http:// www . mnis.net), 
for example. TextWise currently has new products under development for 
information extraction , multilanguage retrieval, intelligent agents for 
finding information, and datamining of very large databases. Visualizers 
now. . . 

...the intellectual property assets of a potential acquisition. Use it in 
conjunction with the IBM patent search site Resource Page (http:// 
patent , wcomplex . ibm . com/respage . html) by choosing MAPIT. 

Custom hot sites : Organizations such as SCIP and AFCEA have established 
standing queries for their members on hot... 

...subtopics and find source documents for each subtopic. To see a 
demonstration, check out http:// www .afcea.org/drlink. 
Current 1998 Projects 

CHESS: an information extraction system which extracts information from 
an entire database and organizes it to track people, companies, 
countries, or organizations over time. CHESS also provides... 

...and preferences, resulting in improved precision. The system evolves 
automatically to match the user's changing interests. 



KNOW- IT: an integrated knowledge extraction system which takes raw 
text as input and outputs a structured knowledge base in any domain. The 
system includes navigation and collaboration tools for manual refinement of 
the extracted knowledge base. This data mining tool makes any recorded 
knowledge amenable to automatic conversion into. . . 
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37/3, K/7 (Item 6 from file: 13) 

DIALOG (R) File 13 : BAMP 

(c) 2005 The Gale Group. All rts. reserv. 

00501562 Supplier Number: 23598972 (USE FORMAT 7 OR 9 FOR FULLTEXT) 

A New Patent Search Tool for the Internet : QPAT-US 

(QPAT-US is Questel -Orbit ' s new database of full-text US patents from 
1974 -present ; review of the product , along with practical tips on how to 
use it, are presented) 
Article Author (s) : Lambert, Nancy- 
Database, V 19, n 4, p 56-58,60+ 
August 1996 

DOCUMENT TYPE: Journal ISSN: 0162-4105 (United States) 
LANGUAGE: English RECORD TYPE: Fulltext; Abstract 
WORD COUNT: 3047 

(USE FORMAT 7 OR 9 FOR FULLTEXT) 
A New Patent Search Tool for the Internet : QPAT-US 

(QPAT-US is Questel -Orbit ' s new database of full-text US patents from 
1974 -present ; review of the product, along with practical tips on how to 
use . . . 

) 

ABSTRACT : 

QPAT-US is the new patent search tool for the Internet released by 
Questel -Orbit . QPAT-US, containing patent information from 1974 to the 
present, can be used by both the inexperienced and experienced patent 
searchers. While it directs the inexperienced searcher to the Main Search 
page for patent searches, it allows the experienced searchers to utilize 
fairly advanced search and display capabilities. Equipped... 

TEXT: 

by Nancy Lambert 
Information Analyst 

Chevron Research and Technology Co. 
photo omitted 

Patent searching on the Internet is a hot topic in the information 
world these days, even though the resources to... 

Internet patent search resources took a big step forward with the 
introduction of QPAT-US, Questel-Orbit ' s new database of full-text U.S. 
patents , 1974 to present. While this 110GB database is not free, 
subscriptions are priced competitively: unlimited... 

...Of interest to users of the PTO Internet database: Questel-Orbit has 
mounted a free database of front - page patent information with all 
the QPAT-US search features. And, Questel-Orbit has offered QPAT-US... 

. . .and happily browse through patent titles that the search produced, 50 
at a time. Experienced patent searchers can take advantage of some 
fairly sophisticated search and display capabilities. 



RELEVANCE RANKING 



All the searches you, . . 

...appears in a patent and weighs this against how common the term is in 
the database and the length of the patent compared to others in the 
database . If you're searching multiple terms, how many of them show up 
in a patent also affects its relevance... 

. . .other things being equal (including the frequency of the terms from both 
groups in the database ) , patents with 99 percent of term occurrences 
from one group and 1 percent from the other. . . 

...equally with patents with 50 percent of term occurrences from each 
group. Clearly, I would want to see the latter first. Questel-Orbit may 
want to adjust the factors that go into relevance ranking to take 
Boolean logic into consideration. 

photo omitted 

* Whenever I talk about free-text patent searching with. . . 

...searching is not currently useful, though, since the U.S. classes are 
those on the patents when they first issued. (See the "Wish List" 
section at the end of this article for more on. . . 

...in either direction of another group of search terms) --an important 
feature for full -text searching that search engines for other Internet 
patent databases don't offer. You can search multiword phrases by putting 
them in single quotes. . . 

...the search term box and adding or deleting parentheses, terms, and 
operators . 

You can also get online help in developing your search statement. The 
system automatically does what Questel-Orbit calls "search. . . 

...seriously skew the relevance ranking, since this is based partly on the 
frequency of a search term's occurrence in the database . If a 
particular misspelling occurs only once, the patent in which it appears 
will be thrown to the top of the relevance list. I... 

...search for "hypodermic," you'll find "syringe," "ampule," "cannula," 
and "needle" as statistically related. 

The third vocabulary aid, " Dateibase Dictionary, " you use if your search 
term produced no hits. It shows you a list... 

...click on an answer set, you go to the View Set page and see an initial 
display of patent numbers and titles for the first 50 answers in 
relevance -ranked order. You may ask for the next 50, and so... 

. . .You may click on "expand view" to add two lines of abstract text to the 
first 50 titles. You may look through the titles and select patents of 
interest to order full copies, to put into a special "set 0" for further... 

. . .on the View Set page, you may do this same sort of process on the first 
five patents by clicking on "KWIC" (Key Word in Context) . At any point 
you can use Windows . . . 

...marked "similar" to see a menu of what QPAT-US calls "canned searches." 



These automatically extract data from the patent for use in new 
searches. You can search on the entire text (minus stop words) of the 
abstract or the main claim as a "natural language query" to get a new 
set of relevance -ranked patents . (I'm not sure just how useful this is, 
given how many totally irrelevant words are bound to show up in the 
abstract or main claim.) You can search on the patent 's 
classifications, either the primary class or all classes, to find other 
patents in the same classes (but don't do this yet- -see the first "Wish 
List. . . 

...variety of citation searches: forward (more recent patents that have 
cited this one) , backward (older patents cited in this one) , or " network 
" (a backward citation search followed by a forward citation search done 
on all the patents found in the backward. . . 

...how important up-to-date classes are for U.S. patent subject searching, 
since older patents are reclassified with new classes when these change 
(3) . Even if patent classifications are not the main point of this 
text-oriented database , they can be very useful both by themselves, if a 
good subclass exists for the... 

...page drawings and chemical structures. HTML provides for this 
possibility. 

One of the joys of patent searching online is being able to go quickly 
and easily between files, taking patent references from one database 
into another for further search refinement or viewing. QPAT-US is 
isolated now, with no easy links into Questel's... 

. , .would like to be able to start a search in, say, the merged API-World 
Patents Index database and then cross U.S. patents into QPAT-US for 
some full-text search refinements (or vice versa) . I also want to be able 
to move U.S. patents that I found from searching the indexed databases 
into QPAT-US to view them using its text-scanning capabilities. Ideally, 
the interface would be completely transparent from both the databases and 
the Internet side, 
photo omitted 

When I'm searching QPAT-US by patent number, either directly or by 
transferring patents from other databases, I want to provide keywords 
or character strings that the software will use to create "found terms" at 
the top and highlighted text throughout each patent . 

I would like to see some changes in "fuzzy logic." I very much want to 
see postings listed beside the terms, to help me choose between 
one-time -only . . . 

...would give it a definite edge, making it one-stop shopping for much 
U.S. patent searching . 

My major wish is based more on Internet than QPAT capabilities. I wish 
everything would go faster! Several operations that I tried had... 

...customers a lot for that work time. This doesn't allow for a search that 
takes an hour on the Internet , which I could have done in ten minutes 
on a commercial database. 



Paul Albert of. 



...who led the design team for QPAT-US, wishes that customers would hurry 
up and get more powerful computers and Internet browsers, so that he 
could write operations to take advantage of their graphical interfaces. 
The most up-to-date systems have a lot of... 

...to manipulate information and find and display data. 

CONCLUSION 

I am still not an enthusiastic Internet -for- patent - searching 
advocate, unlike some others who are writing in the field (4,5). Nothing 
that I . . . 

...to give them access to these indexed databases. QPAT-US is essentially a 
free-text patent database , but it does take free-text searching and 
scanning to higher levels than anything else out there, either on. . . 
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. . .TEXT: 14 sectors that find patents to be an important information source 
is correlated with the patent propensity rate in each sector. The second 

part of the study is based on a combined survey and interview study of 
Dutch SAIEs in five high-technology sectors. The results show that SMIlEs 
mainly use patent databases to acquire information, often for legal 
purposes, that is not available from any other source. In contrast... 

...and expertise. This points to the need for simpler and more efficient 
methods of searching patent databases . 

Introduction 

In order to receive a patent in Europe, the US and many other 
countries, an applicant must disclose enough information about... 
...legal purposes connected to the management of intellectual property. For 
example, a firm can search patent applications to determine if a planned 
innovation project might infringe another firm's patents , to check if 
other firms are infringing its own patents, to challenge a competitor's... 

...applications and granted patents, while the US Patent and Trademark 
Office (PTO) contained several million patent records. IBM has 
established a free Internet site that contains the front page , 
abstract and other information for two million PTO patents issued since 
1971. Firms can use several keywords to search the IBM site , identify 
patents of interest and then immediately order a copy of the complete 
patent for a small... 

...here as firms with fewer than 500 employees, do not use patent data. A 
recent EPO survey of a sample of research and development (R&D) 
performing firms reports that only 14% of firms with fewer than. . . 

...used patent databases. 7 This survey, unlike the others, also asked the 
respondents about four main reasons for consulting patent databases: to 
develop new products (or processes) , to follow the activities of 
competitors, to prevent the duplication of R&D and to patent a new 
invention. Approximately 3 8% of the respondents that searched patent 
databases used the information. . . 

. . . asked why SMEs do not use patent databases , 

The failure of many SMEs to use patent databases is taken up by the 
recent Green Paper on innovation by the European Commission, 8 which 
comments ... to innovative firms, because firms that do not innovate are 
considerably less likely to find patent data of use. The first method 
consists of an analysis of the relevant results of the 1993 Community 
Innovation Survey. . . 



...to patent information of value for technical, competitive or legal 
reasons. The CIS did not obtain data on whether or not each firm applied 
for a patent , but ordinal data on the importance of patents as a method 
of profiting from innovation. . . 

...sources and for the relationship between firm size and the percentage of 
firms that use patent data. The main advantage of the EPO compared to 
the CIS survey is that the EPO study contains .patent disclosures as 'very 
important' or 'crucial' sources of information with estimates of the 
sectoral patent propensity rates. The latter are obtained from the PACE 
survey of Europe's largest R&D performing firms because the equivalent... 
...increases the probability that the firm will give a higher rating to the 
value of patent databases as an information source. 

The first three variables define fundamental characteristics of the 
firm. They consist of the log of the number. . . 

...a higher percentage of R&D performing firms than non-R&D performing 
firms use patent databases. 

The second set of variables measure the specific strategic environment of 
the firm. The proportion of sales ... intensity , and presumably exposed to 
greater competition, could be more likely to benefit from using patent 
databases to gather information about what their competitors are doing. 

The second and third logit models include the... 

...they encompass a range of new technologies that could be at the 
forefront of any changes in the importance of patenting . Two of these 
sectors, IT and instruments, are similar to the computing and instruments 
sectors ... 33 . 3% among the target population of 'possible' innovators. 

Detailed information on the use of patent databases was obtained 
through semistructured interviews with firms from each of the five sectors 
of interest that had. . . 

...applicant IT firms. 

These results are consistent with and support the CIS analyses and the 
findings of the EPO study. Patent applicant firms {or firms that 
probably apply for patents because they find patents... 

...information source or to search patent databases. This indicates that 
many firms do not search patent databases to obtain technical 
information. If they did, we would expect little difference in the use rate 
between. . . 

...the firm's patent rights; for instance, to ensure that the firm is not 
infringing another firm's patent or to obtain the necessary 
information on the prior art to draw up its own patent applications. The... 

...explored more closely in the interviews. 

Interview Results 

The interview results show that SMEs use patent databases for three 
basic reasons. The most important is for legal matters, as suspected given 



the higher use. 



...to obtain technical information for use in innovation. For example, each 
firm usually had a main reason and one or more secondary reasons for 
using patent databases. Obtaining technical information was the 
primary reason for searching patent databases for only tAo of 24 
interviewed firms that used patent databases, compared to 14... 

...on their competitors. However, slightly more than half of the 22 firms 
that did not obtain technical information from patent databases as a 
primary goal stated that this was a secondary reason for their use. 

The interviews showed that there is a simple explanation for why. . . 

. . .primarily use them to obtain information that is not available from any 
other source. 

The main cost-related obstacle cited by the respondents to using patent 
databases was the time and expertise required to conduct an adequate 
search, while access costs... 

...not search them in-house but employ external consultancy services or 
innovation centres to conduct patent searches for them. The other firms 
search patent databases themselves, three via on-line services, but 
several of these also use external consultants. 

(Graph Omitted) 

Captioned as: Figure 4 

Another reason why firms might not use patent databases to acquire 
technical information is the delay, in most countries, of 18 months between 
the submission of . . . 

...reason, the interviewed respondents were asked if they thought that the 
technical information in a patent database was up to date and of value. 
With one exception, most of the respondents f rom. .. information disclosed in 
patents was no longer useful because of the rapid rate of technical change 

in their field. Their primary reason for patenting in the first 
place was to be able to trade technologies through cross-licensing, rather 
than an interest... 

...this source of information was often based on two rational assessments 
of their usefulness. 

The first reason is linked to the patenting strategy of firms. Firms 
are less likely to apply for a patent if the cost... 

...potential or actual technology 'trading' rather than as a means of 
earning licence revenue. 

The second reason why SMEs seldom use patent databases to acquire 
technical information is because of the high costs of searching patent 
databases compared to much easier and cheaper alternative sources of 
technical information. Consequently, SMEs primarily use patent databases 
to acquire information that is not available from any other source; for 
example, to search the prior... 



...would invest in costly research to 'reinvent the wheel'. Unfortunately, 
current databases such as the Patent Register of the EPO are relatively 
simple to search to obtain information of value for legal purposes, but 
of no use as a source of technical... 

...for technical information. Several respondents commented that less 
expensive and time-consuming methods of accessing patent databases for 
example, via the Internet -would probably increase their use of these 
databases. This ...to reduce the time required to find something useful. 
Part of the problem is that patent databases must contain a substantial 
amount of similar and duplicative information. Software that could limit 
this repetition could be of great value. It may also be worth preparing and 
disseminating more information on how to use the Internet to access 
patent databases. A few of the interviewed managers, for example, did not 
know that several patent . . . 
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... quite happened yet. 

* The United States Patent and Trademark Office (USPTO) is mounting a 
free database of full-text U.S. patents , 1976-date, starting in 
November . 

* The European Patent Office (EPO) is mounting a free database of 
its member countries' patents , starting in October. 

* The IBM Patent Server will cease to exist on November 1, to... 

...in effect, writing about the future. Watch this space for more 
information as it develops. 

Online Patent Databases 

But first , what's new in online patent databases. 

IFI/Plenum Data Company: IFI/Plenum Data's big news is that the Dutch 

...in research-level scientific publishing. Harry All-cock, Vice President 
of IFI/Plenum Data, assures patent searchers that the change in 
ownership won't affect IFI databases and services. 
IFI has added U.S. patent... 

...dates to the CLAIMS databases. Searchers requested this because of 
complications brought about by the change in U.S. patent law on June 8, 
1995. Patents filed after that date are good for 20 years from date of 
first U.S. filing; but patents in force at that time are good for 20 
years from date of filing or. . . 

...patents. Part of their excuse for these omissions has to do with Project 
Phoenix, their change in internal processing designed to speed up the 
passage of patents through abstracting and indexing and to get them 
online faster. Apparently, adding equivalent abstracts and patent 
citations took too long. In practice, Derwent probably wants to force 
customers to search patent... 

...and have added European and U.S. so far. They are indeed making 
progress. British patents now take just 15 days from arrival at Derwent 
to get full records with complete indexing online . U.S. patents take 

51 days as of Derwent week 9,832 {32nd week of 1998), down from 84... and 
examine full text on QPAT-US as usual, and then link to the IBM site to 
look at the patent images. One benefit: he IBM site does not see who is 
asking for the patents ; it sees only that the request comes from 
QPAT-US. This should help solve some security issues with the IBM site. 
(See the discussion later in this article on the new, improved IBM site . ) 

INPI, the French Patent Office, has taken over management of all 
Markush chemical structure files, merging MPHARM (the pharmaceutical file 
that INPI . . . 



..,STN/CAS: STN has introduced some new systems commands that will help 
customers who do patent multi-file searches and patent statistics. 
These are : 

1. "TRANSFER": Extracts and maps up to 50,000 pieces of data, e.g., 
patent numbers, from file... 

...perform these two functions.) 

3. "TABULATE": Creates, among other things, two-dimensional tables of 
data extracted from a set of patents , for instance patents by company 
by year for a technology of interest. My STN contact told me that... 
...extended family members (complex priority relationships). More details 
later, when CAS produces documentation for the online file. 

3, What percent of patent records will, in fact, include family 
information? 

Answer: Coverage is nearly complete from the early... free Internet 
databases of full-text U.S. patents, 1976 to date, and U.S. trademarks . 

The full-text patent database will use BRS Search software from 
Dataware, rather than the CNIDR software used on the current AIDS and U.S. 
patent frontpage (PatBib) databases. However, it will look like and have 
many of the same search features . . . 

. . .only. 

The AIDS database gives you the ability, via icons at the top of 
every patent front page , to search for similar patents : more in 
the same class, more by the same inventor or assignee, more "like this... 

...won't exist in the full-text database. 

Both the current databases provide date range searching . The 
full-text database will definitely allow date comparisons (all patents 
before/after a given date) ; it might or might not allow ranging between 
dates . 

The initial release of the full- text file will not have current U.S. 
patent classifications, but the PTO will add these later. 

The initial release will definitely not include proximity 
searching; that is, looking for one term within so. . . 

. . .partly because the PTO has made some concessions so as not to compete 
with commercial patent database producers and online hosts. Also, 
while the search engine does support proximity searching, the capability 
requires more storage and processing resources than the... 

...Information Dissemination (wes.gewehr@uspto.gov). 

The PTO will mount the text portion of the patent database in 
November and add images next March. Low-resolution images will be free; 
high-resolution. . . 

...the AIDS database will go away; but the PTO will put one or more canned 
searches on the full -text database to produce subsets of AIDS patents . 

Elsewhere in the PTO: Keep aware of some problems with U.S. 
provisional applications, available... 

. . .provisional filing to a normal filing within the one-year time allowed. 
The U.S. patent does not include the provisional filing information on 
its front page , nor does the PTO include it in the tapes that it sends 
to patent database producers, so these family members are often not 
connected up. The PTO is aware of the problem, as are the database 



producers; a solution should be available soon. 
The European Patent Office 

Last year, the EPO announced changes in their data distribution and 
pricing policies. As part of a plan to maximize access... 

. . .country. Individual countries may choose to add additional coverage at 
their own expense. The EPO originally named this project DIPS 

(Distributed Internet Patent Information Services) , but recently 
changed the name to EspOcenet. The sites have very limited search 
capabilities, but they do let searchers link to images of the... 

. . .the EPO Web site and investigated the databases, I found some very 
interesting inconsistencies. 

To get to the patent databases, you click on the Esp®cenet logo 
at the bottom center of the EPO home page . You immediately see two 
options: "Access via the European Patent Office" and "Access via the 
national offices of the member states." When you access via the European 
Patent Office, you then have four choices: Search in EPO , PCT, 
worldwide, or Japanese patents. When you access via the national offices of 
the member... in other cases, nothing. Most interesting, though: In many 
cases when I clicked on a patent , what I actually saw was an equivalent 
from another country, usually EPO or U.S. 

I'll be interested in hearing what other searchers observe as they 
explore this international patent search site . 

Incidentally, the EPO home page has well -designed mechanisms for 
learning about, and getting around, the site . If you click on the 
navigational icon (the spinning EPO logo in the center of the page) , the 
system will guide you. Give it a try! 

The EPO site also has one of the best compilations of patent 
information resources on the Net . It links you to the patent offices of 
EPO member states and other countries, to patent information providers' 
and online hosts' home pages , to Internet patent databases, to 
patent -related mailing lists and news groups (including PIUG, the Patent 
Information Users Group), and lots more. How easily you can get to this 
resource from the EPO home page seems to vary over time. As of 
mid-October, you can click on "contact links... 

...right of the EPO home page. But you can also access it directly at 
http:// www .european- patent -office.org/ online /index. htm. 
Other International Patent Databases on the Internet 
Independent of the EPO, other countries are starting to put varying 
amounts of patent information free on the Internet . The Canadian 
Patent Office has mounted patents from 1989 to date, searchable only 
by title text. The French Patent Office has mounted patents from 1996 to 
date. . . 

...These data-bases won't replace DWPI or Inpadoc any time soon. 
The New IBM Patent Sites 

By the time you read this, the IBM Patent Server will be gone, 
replaced by two new patent information sites . (That phoenix analogy 
keeps coming up.) These are the IP Network (free) and the IP Network for 
Business (fee) . The free site will have all the features of the old 
Patent Server plus some nice new features, including 20 new patent fields 
to search, date ranging, and the ability to page through search results. 
IBM is also adding European patents to the free site , EPO-A from 1978 
to date and EPO-B ESPACE-A and ESPACE-B CD-ROM products. And, IBM is adding 
PCT patents : bibliographic information, abstracts, and front - page 



images from 1997 to date, and full document images from 1998 to date. 
The Business . . . 

. . .try them out . 
Conclusion 

I've written a lot in the last few years about patent resources on 
the Internet , As you can see, they're still growing fast, and they 
increasingly include international as... 

...no convenient access to patent information. But all the caveats that 
I've issued about Internet patent freebies still apply. If you need 
indepth subject access across multiple databases covering multiple 
countries' patents , you still need the commercial indexed patent 
databases. Sometimes you get what you pay for. 



Set Items Description 

51 8377481 SEARCH? OR RESEARCH? OR RETRIEV? OR INQUIR? OR QUERY? OR Q- 

UERIES OR EXAMIN? OR INSPECT? 

52 3144275 REQUEST? OR DATA {) MINE? OR DATA () MINING? OR DATAMINE? OR D- 

ATAMINING? OR FIND? OR DISCOVER? 

53 582521 INTERROGAT? OR WEBCRAWL? OR WEB () CRAWL? OR METACRAWL? OR M- 

ETAO CRAWL? OR SEEK? OR SORT? OR HUNT? 

54 897634 INTELLECTUAL {) PRO PERT? OR PATENT? OR COPYRIGHT? OR TRADEMA- 

RK? 

55 937 TRADE 0 DRESS? OR DESIGN {) RIGHT? OR PROPRIETARY () INFORMATIO- 

N? 

56 0 LEGALLY 0 PROTECTABLE 0 KNOWLEDGE OR TANGIBLE () RESEARCH () PRO- 

PERTY 

57 9595 JAPIO OR JPO OR EPO OR USPTO OR WIPO 

58 979283 DATABASE. OR DATABANK OR DATA{) (BASE? OR BANK? OR FILE? OR - 

REPOSITOR? OR WAREHOUSE?) OR DB OR RDB OR OODB OR ODBC OR DBMS 

59 4246060 NETWORK? OR NET? ? OR INTERNET? OR INTRANET? OR LAN? ? OR - 

WAN? ? OR ONLINE 

510 162661 ETHERNET? OR EXTRANET? OR WWW OR WORLD () WIDE () WEB OR WORLD- 

WIDEWEB OR SUBNET? 

511 314756 WEBSITE? OR WEB () SITE? OR WEBPAGE? OR WEB {) PAGE? OR WEB ( ) A- 

DDRESS? OR URL?? OR URI?? 

512 2198390 HOMEPAGE? OR HOME () PAGE? OR FRONTPAGE? OR FRONT () PAGE? OR - 

SITE? OR HTML {) FILE? 

513 9460743 STORE OR STORING OR MEMORY OR ACCUMULAT? OR RECEIV? OR ACC- 

EPT? OR ACQUIR? OR OBTAIN? OR CULL? OR CACHE? 

514 3145237 STOCK? OR COLLECT? OR GATHER? OR GLEAN? OR AMASS??? OR ACC- 

RU? OR AGGREGAT? OR COMPIL? OR SIFT? OR CACHING 

515 8982077 PULL {) DOWN? OR TAKE? OR STORAGE? OR TAKING? OR DERIV? OR P- 

ROCUR??? OR GET? OR TAP? ? OR CAPACIT? 

516 5054689 CONVERT OR CONVERTS OR CONVERTING OR CONVERSION? OR TRANSF- 

ORM? OR ALTER??? OR REFORMAT? OR EXTRACT? 

517 8874634 MODIF? OR REVIS??? OR TRANSLAT? OR REMODEL? OR ADAPT? OR C- 

HANGE OR CHANGE? ? OR CHANGING? OR COLLAT? 

518 9681972 FIRST? OR 1ST OR PRIMARY OR INITIAL? OR ORIGINAL? OR LEADO- 

FF? OR MAIN OR CHIEF OR INTRODUCTORY? 

519 8017483 SECOND? OR 2ND OR DOUBL? OR TWIN? OR EXTRA? OR DUPLICAT? OR 

ANOTHER OR SUBSIDIAR? OR AUXILIAR? 

520 6018468 THREE? OR TRIO? OR TRIUNE? OR TRIAD? OR TRIPL? OR TERTIAR? 

OR THIRD OR 3RD 

521 6792 S1:S3 AND S4 : S6 ( ION) S8 : S12 

522 5045 S21 AND (S1:S3 OR S7) AND S4 : S6 ( 5N) S8 : S12 

523 4097 S22 AND S1:S3 AND S4 : S6 (3N) S8 : S12 

524 150 S23 AND (S13:S15 AND S16 : S17 ) ( ION) S4 : S6 

525 25 S24 AND (S13:S15 AND S16 : S17 ) ( 5N) (S4 : S6 AND S11:S12) 

526 28 S23 AND S16 : S17 ( ION) (S4 : S6 ( ION) Sll : S12 ) 

527 3 S23 AND S13 : S15 ( ION) (S16 : S17 ( ION) (S4 : S6 ( ION) Sll : S12 ) ) 

528 31 S23 AND S1:S3 AND (S13:S15 AND S16:S17) (ION) (S4:S6 OR S7) (- 

10N)S11:S12 

529 3 S23 AND S1:S3 AND (S13:S15 AND S16:S17) (ION) (S18 : S20 ( ION) S- 

4 :S6) (ION) S11:S12 

530 171 S24:S29 

531 18075233 PY>2000 

532 117 S30 NOT S31 

533 107 RD (unique items) 

534 1718 S21 AND S1:S3(5N)S4 

535 91 S34 AND (S13:S14 AND S16 : S18 ) ( ION) S4 

536 56 S35 AND S9:S12 

537 109 S34 AND S16:S17 AND S18:S20 AND S4 ( ION) S9 : S12 

538 45 S37 AND S13 :S15 

539 174 S35:S38 



540 97 S39 NOT S31 

541 74 S40 NOT S33 

542 66 RD (unique items) 
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Author Affiliation: IBM Almaden Res. Center, San Jose, CA, USA 
Journal: SIGMOD Record Conference Title: SIGMOD Rec. (USA) vol.27, 
no. 2 p. 307-18 

Publisher: ACM, ^ — =^ 

Publication Date: June/i998 Country of Publication: USA 

CODEN: SRECD8 ISSN: 0^63^-5^08 

SICI : 0163-5808 ( 1998 06 ) 2 7 : 2L . 3 07 : EHCU; 1-0 

Material Identity Number: A660-98003 

Conference Title: 1998 ACM SIGMOD International Conference on Management 
of Data 

Conference Date: 1-4 June 1998 Conference Location: Seattle, WA, USA 
Language: English 
Subfile: C 

Copyright 1998, lEE ^ O J> 

^,-rAbs't!ractl cha 1 1 enge Tn~~irrd'ex-i^ unstructured hypertext databases is to 
atitomatically extract meta-dafea'^hat enables structured searching using 
t|ppic__t.axonGmies-; — ed-Trci[Imv?n^^ keyword ambiguity and improves the quality of 
searching and prof ile -based routing and filtering. Therefore, an accurate 
classifier is an essential component of... 

... documents having known topics. We experimented with pre-classif ied 
samples from Yahoo! and the US Patent Database . We have developed a 
text classifier that misclassif ies only 13% of the documents in the Reuters 
benchmark; this is comparable to the best results ever obtained . Our new 
classifier misclassif ied 36% of the patents , indicating that classifying 
hypertext can be more difficult than classifying text. Naively using terms 
in. . . 

...Descriptors: information retrieval ; 
...Identifiers: structured searching ; ... 

, . . search quality. . . 
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Conference Title: Proceedings of the 1995 International Chemical 
Information Conference p. 1-24 
Editor(s): Collier, H. 
Publisher: Infonortics, Calne, UK 

Publication Date: 1995 Country of Publication: UK 159 pp. 
Material Identity Number: XX96-00311 

Conference Title: Proceedings of the 1995 International Chemical 
Information Conference 

Conference Date: 23-25 Oct. 1995 Conference Location: Nimes, France 
Language: English 
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Title: Revolution or evolution? The impact of the Internet , end user 
interfaces and new software on chemical and patent information 

Abstract: Looks at the Internet and what it offers searchers of 
chemistry and patent literature. I comment on some of the developments I 
see happening, or would like to see happen. We are seeing a lot of exciting 

changes taking place, and a lot of research is going on into new 

methods of searching , retrieving , storing and displaying chemical and 

patent information. Old ways are being broken or amended, and many 
newcomers are experiencing the joys of electronic information for the 

first time. Is it revolution or evolution? I would call it evolution, but 
evolution at a. . . 

...Descriptors: information retrieval systems... 

. . . Internet ; ... 

. . . online front -ends 

Identifiers: Internet ; ... 

. . . online searching ; ... 

. . .information retrieval methods. . . 
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Publication Date: Nov. 1991 Country of Publication: USA 
CODEN: WPAID2 ISSN: 0172-2190 
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Abstract: The 1970s and 1980s saw the rise and rapid development of 
online textual databases, particularly those covering science and 
technology. Online patents files are amongst the most used of these 

databases, and are now accessed worldwide by both patent specialists and 
researchers . However, most online databases covering intellectual 
property are based on an original document or publication which 
includes highly meaningful drawings, chemical structures or diagrams. The 
rapid advances in image handling and distribution technology, such as 
document image processing (DIP), image extraction , CD-ROM, WORM discs, 
erasable optical discs, high speed laser printing, and the delivery of... 
...Descriptors: information retrieval systems... 
...Identifiers: image extraction ; 
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Publication Date: May 1989 Country of Publication: USA 
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Title: DIALOGLINK and TRADEMARKSCAN -FEDERAL: pioneers in online images 
Abstract: DIALOG and Thomson & Thomson teamed up to produce one of the 
first commercial online databases containing both text and images. 
Thomson & Thomson, a trademark and copyright research firm, added 

trademark designs to its text file, TRADEMARKSCAN- FEDERAL . 

Simultaneously, DIALOG developed DIALOGLINK, version 1.20, a powerful and 
easy-to-use software to receive and display images online from 
mainframe computers . The author evaluates TRADEMARKSCAN image retrieval 

using DIALOGLINK 1.2 0 communications software. She discusses 

TRADEMARKSCAN 's composition, retrieval methods, file size, downloading 
and printing time and costs, and database applications. She also discusses 
software requirements and functions that are relevant to image retrieval . 
Suggestions for improvements to TRADEMARKSCAN and DIALOGLINK are also 
given . 

...Descriptors: information retrieval systems... 
...Identifiers: online images... 

. . .commercial online databases. . . 

, . . copyright research firm. . . 
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Source: Proceedings of the Joint Conference on Information Sciences v 5 n 
2 2000. 

Publication Year: 2000 
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Language: English 

Title: Synergetic Neural Network Approach for Content -Based Retrieval 
of Trademarks 

Abstract: A application of synergetic neural network (SNN) for content 
based retrieval was developed that are robust against noise, partial 
occlusions and is capable of producing fast response to input queries . 
The SNN is a top-down self -organizing system, which incorporates many of 
the basic concepts of synergetics. The system enables to support affine 
invariant of input queries which are a partial version of the stored 
patterns. It was observed that the number of visual keywords do not 
increase even when new trademark images were added to the dateQjase . 
{Edited abstract) 11 Ref s . 
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Strategy for information on patents in the era of networks 
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ARTICLE TYPE: Original paper 
MEDIA TYPE: Printed Publication 

Strategy for information on patents in the era of networks 
ABSTRACT: Taking a general view of an environmental change in a 

distribution of information on patents that simplified access to such 
information is provided because more datacomes from sources and 
programs of software for retrieval are improved, it indeed seems to 
us thattoday is brought in the era of which anyone can obtain 
desirable information on patents . On theother hand, taking into 
consideration of a peculiar feature of the information on patents 
that errorfor retrieval is not permitted, the environmental change 
further provides difficulty of the access thatis caused by a lot of 
obtainable information, and a variety of media sources. From the 
pointsdiscussed above, it will be understood. . . 

...end users. That is, the information necessary for users is notfound by 
users, but a third party selects and customizes the information to be 
delivered to theusers. In this case, if... 

...to external sourceshas to be carried out, while a serious view of core 

competence is taken by companies themselves, (author abst.) 
...DESCRIPTORS: patent search ; ... 

. . .information retrieval ; ... 

. . . internet 

.. .BROADER DESCRIPTORS: retrieval ; ... 

. . .computer network ; ... 

. . .communication network ; ... 
...information network ; ... 
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TITLE: Intellectual assets --a price on (what's in) your head 

AUTHOR: Bolita, Dan 

SOURCE: KM World, v8 n2 p24(2) Fe 
ISSN: 1060-894X 
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...including intellectual assets, whose value over the last seven years has 
increased seven- fold. The first step in putting a numeric value on an 
organization's intellectual property is the process of making the asset 
as tangible as possible. Ideas, processes, concepts/business intelligence, 
CAD drawings, database entries, procedure manuals, and paten t--g ran bp^ 
converted to archived docu ments^ so organizations can better measure the 
va±ue'"ot intellectual assets an'S include them as tangible assets. IPAM is a 

patent management and product data tool with a collection of analysis 
tools and databases for organization and analysis of intellectual assets. 
SmartPatent Workbench is a desktop client package that allows customers to 
see, print, search , organize, and analyze patent documents. Dow 
Chemical, a user of IPAM, says quantification of intellectual assets can 
lead to. . . 





DESCRIPTORS: Engineering Documentation; Patents ; Record Management; 
Research & Development 



Set 
SI 

S2 

S3 

S4 

S5 

S6 

S7 

S8 

89 
SIO 

Sll 

S12 

813 
814 

815 

816 

817 

818 

819 

820 

821 



OR PROPRIETARY ( ) INFORMATIO- 
OR TANGIBLE ( ) RESEARCH ( ) PRO - 



Items Description 
1701624 INTELLECTUAL 0 PROPERT? OR PATENT? OR COPYRIGHT? OR TRADEMA- 
RK? 

998 TRADE 0 DRESS? OR DESIGN () RIGHT? 
N? 

0* LEGALLY ( ) PROTECTABLE ( ) KNOWLEDGE 
PERTY 

219476 DATABASE OR DATABANK OR DATA ( ) (BASE? OR BANK? OR FILE? OR - 
REP08IT0R? OR WAREHOUSE?) OR DB .OR RDB OR OODB OR ODBC OR DBMS 

610309 NETWORK? OR NET? ? OR INTERNET? OR INTRANET? OR LAN? ? OR - 
WAN? ? OR ONLINE 

• 65259 ETHERNET? OR EXTRANET? OR WWW OR WORLD {) WIDE () WEB OR WORLD- 
WIDEWEB OR SUBNET? 
91895 WEBSITE? OR WEB () SITE? OR WEBPAGE? OR WEB () PAGE? OR WEB()A- 
DDRE88? OR URL?? OR URI?? 
318662 HOMEPAGE? OR HOME () PAGE? OR FRONTPAGE? OR FRONT () PAGE? OR - 
SITE? OR HTML 0 FILE? 
31022 81 :83 (ION) 84 :S8 

28367 SEARCH? OR RESEARCH? OR RETRIEV? OR INQUIR? OR QUERY? OR Q- 
UERIES OR EXAMIN? OR INSPECT? 

26058 REQUEST? OR DATA () MINE? OR DATA () MINING? OR DATAMINE? OR D- 
ATAMINING? OR FIND? OR DISCOVER? 

12078 INTERROGAT? OR WEBCRAWL? OR WEB () CRAWL? OR METACRAWL? OR M- 
ETAO CRAWL? OR SEEK? OR SORT? OR HUNT? 
2 317 JAPIO OR JPO OR EPO OR USPTO OR WIPO 

28865 STORE OR STORING OR MEMORY OR ACCUMULAT? OR RECEIV? OR ACC- 
EPT? OR ACQUIR? OR OBTAIN? OR CULL? OR CACHE? 

19756 STOCK? OR COLLECT? OR GATHER? OR GLEAN? OR AMASS??? OR ACC- 
RU? OR AGGREGAT? OR COMPIL? OR SIFT? OR CACHING 

28622 PULL 0 DOWN? OR TAKE? OR STORAGE? OR TAKING? OR DERIV? OR P- 
ROCUR??? OR GET? OR TAP? ? OR CAPACIT? 

213 3 0 CONVERT OR CONVERTS OR CONVERTING OR CONVERSION? OR TRANSF- 
ORM? OR ALTER??? OR REFORMAT? OR EXTRACT? 

29339 MODIF? OR REVI8??? OR TRANSLAT? OR REMODEL? OR ADAPT? OR C- 
HANGE OR CHANGE? ? OR CHANGING? OR COLLAT? 

29833 FIRST? OR 1ST OR PRIMARY OR INITIAL? OR ORIGINAL? OR 
FF? OR MAIN OR CHIEF OR INTRODUCTORY? 

2 97 67 SECOND? OR 2ND OR DOUBL? OR TWIN? OR EXTRA? 
ANOTHER OR SUBSIDIAR? OR AUXILIAR? 

24715 THREE? OR TRIO? OR TRIUNE? OR TRIAD? OR TRIPL: 
OR THIRD OR 3RD 



822 


5875 


IC=i 


G06F' 


p 


823 


2335 


(89 


OR 813) (5N) SIO :S12 


824 


1920 


823 


AND 


(814:816 AND 817:S18) (ION) (81:88) 


825 


1819 


824 


AND 


(817:818 AND 819:821) (10N)S1:S8 ' 


826 


1582 


825 


AND 


S17:S18(10N) (81:S3 OR 814:816) 


827 


483 


826 


AND 


822 


828 


62 


823 


:S27 


AND S1:S3/TI 


S29 


1582 


825 


AND 


826 


830 


1920 


824 


:827 


OR 829 


831 


1814 


824 


AND 


(S14:S16 AND S17:818) (5N)S1:88 


832 


1280 


831 


AND 


81:83 (5N) 84:88 


833 


405 


832 


AND 


822 


834 


1280 


832 


:S33 




S35 


365 


834 


AND 


(814:816 AND S17:S18) ( 5N) S 1 : 84 ( 5N) 85 : 88 


836 


243 


835 


AND 


(814:816 AND 817:818) (5N) SI : S3 (5N) S5 : 88 


837 


745 


827 


:S28 


OR 833 OR S35 :S36 


838 


569 


S37 


AND 


822 


839 


13487 


AD=: 


2001 


:2005 


840 


258 


S38 


NOT 


839 


841 


258 


IDPAT (sorted in duplicate/non-duplicate order) 




File 348: EUROPEAN PATENTS 1978 -2005/Dec W02 

(c) 2005 European Patent Office 
File 349:PCT FULLTEXT 1979-2005/UB=20051208 , UT=20051201 

(c) 2005 WIPO/Univentio 



41/3, K/52 (Item 52 from file: 348) 

DIALOG (R) File 34 8: EUROPEAN PATENTS 
(c) 2005 European Patent Office. All rts . reserv. 



01068801 

Multilingual patent information search system 
Suchsystem fur mehrsprachige Patentinf ormation 
Systeme de recherche d ' information brevet multilingue 

PATENT ASSIGNEE: 

MAZDA MOTOR CORPORATION, (547923), 3-1, Shinchi, Fuchu-cho, Aki-gun, 
Hiroshima-ken, (JP) , (Applicant designated States: all) 
INVENTOR : 

Nosohara, Makifumi, c/o ITI inc., 6-25, Hikari-machi 2-chome, 
Higashi-ku, Hiroshima-shi , Hiroshima-ken, (JP) 
LEGAL REPRESENTATIVE: 

Skuhra, Udo, Dipl.-Ing. (11161), Reinhard-Skuhra-Weise & Partner 
Patentanwalte Postfach 44 01 51, 80750 Munchen, (DE) 
PATENT (CC, No, Kind, Date) : EP 940762 A2 990908 (Basic) 

EP 940762 A3 040102 
APPLICATION (CC, No, Date): EP 99102878 990303; 
PRIORITY (CC, No, Date) : JP 9850659 980303 
DESIGNATED STATES: DE 

EXTENDED DESIGNATED STATES: AL; LT; LV; MK; RO; SI 
INTERNATIONAL PATENT CLASS: G06F-017/30 ; G06F-017/28 
ABSTRACT WORD COUNT: 8 9 
NOTE: 

Figure number on first page: 1 

LANGUAGE (Publication , Procedural , Application) : English; English; English 
FULLTEXT AVAILABILITY: 

Available Text Language Update Word Count 

CLAIMS A (English) 9936 1824 

SPEC A (English) 9936 9312 

Total word count - document A 11136 
Total word count - document B 0 

Total word count - documents A + B 11136 



INTERNATIONAL PATENT CLASS: G06F-017/30 ... 
. . . G06F-017/28 



. . .SPECIFICATION using the replaced second search expression, and 

when no result is obtained from the new database , the first 
transmission means transmits the replaced second search expression to 
the patent information search apparatus . 

The above object can also be achieved by a patent information search 
apparatus according to claim 28, for searching a patent information 
database on the basis of an input search expression and outputting a 
search result, comprising: 

replacement means for replacing an input first search expression with 
a second search expression for searching the patent information 
database , on the basis of bibliographic information search contents of 
the search expression. 
The above object. . . 

. . .of claims 29 to 31 . 

In the above search apparatus or method, the patent information 



database can be searched on the basis of information other than 
bibliographic information. 

Other features and advantages of the... 

.search apparatus 103; 

Fig. 7 is a flow chart for explaining translation processing of the 
patent information search apparatus 103; 

Fig. 8 is a view showing an example of a menu. . . 

.synthesis result of an English abstract and drawings; 

Fig. 15 is a block diagram showing another arrangement of a patent 
information search system according to the present invention; 

Fig. 16A is a view showing the a patent information search system 
and apparatus, and a control method therefor of the present invention 
will... 

.to an embodiment of the present invention. A client terminal 101 is 
connected to a patent information search apparatus 103 through a 
network 102 (e.g., the Internet) . This client terminal 101 is a terminal 
emulator which receives a command from the information search apparatus 
103 and displays it. The command has a. . . 

.filed in, e.g., Japan, and searches various databases (115, 116, and 
117) storing Japanese patent information and the like in response to a 
request from the client terminal 101. A... 

.charge for fee claiming to the user of the patent information search 
apparatus 103 . 

A translation section 111 requests to translate information 
obtained upon searching the various databases (115, 116, and 117) into a 
predetermined language. A storage... 

.the apparatus to a database server 113. 

This patent information search apparatus 103 incorporates a 
translation engine or tool (not shown) . The translation section 111 
issues the request to this translation. . . 

.communication line together with a text to be translated and receives 
the translation result. 

The database server 113 comprises a CPU 118 for controlling the 
entire database server 113, a ROM 120 storing control programs and the 
like for controlling the database server 113, a RAM 119 used as, e.g., 
a work area in executing a. . . 

.the like stored in the ROM 12 0, and the various databases (115, 116, and 
117) storing Japanese patent information and the like. 

The English abstract database 115 stores English abstracts 
corresponding to laid-open patent official gazettes as one of patent 
information. The full text search database 116 stores laid-open patent 

official gazettes (Japanese) and the like associated with patents 
filed in Japan. 

The database 117 stores various tables such as a synonym table 1201, 
a word replacement table 1202. . . 

.conversion table 13 02, a free keyword table 1601, and an IPC replacement 
table 1602. 

The database server 113 shown in Fig. 1 is independent of the patent 



information search apparatus 103. However, the present invention is not 
limited to this. The arrangement and function of the database server 
113 may be integrated with the patent information search apparatus 103. 

According to the patent information search system of the present 
invention, a person. . . 

.than Japanese can easily search for information related to patents filed 
in Japan from the database server 113 using a terminal device set 
abroad or a foreign- language terminal device in Search " is used to 
search for patent information stored in the datcd>ase (115, 116, or 117) 
on the basis of an arbitrary English word input by the user, and has a 
menu layout as shown in Fig. 9A. The "Expert Search " is used to search 
for patent information stored in the database (115, 116, or 117) after 
the search range is narrowed down to, e.g., search... 

.to laid-open patent official gazettes as one of patent information. The 
full text search DB 116 stores laid-open patent official gazettes 
(Japanese) and the like associated with patents filed in Japan. 

The English abstract DB 115 and full text search DB 116 are, e.g., 
relational databases and have table layouts as shown in Figs... 
.example, not only databases associated with patents but also databases 
of utility models, designs, and trademarks can also be handled. The 
database language is not limited to Japanese. 

Referring back to Fig. 2A to explain the control,.. 

.returns the search result to the terminal 101. 

The search result is received from the patent information search 
apparatus 103 (step S2 07) . As shown in Fig. lOA, the received search 
result . . . 

.searching the full text search DB 116 is selected in step S204, a 
laid-open patent official gazette corresponding to the clicked 
publication number 1002 can be obtained in a display. . . 

.To switch the English abstract 1013 displayed on the monitor to a 
Japanese laid-open patent official gazette, the "CHANGE" button 1010 is 
clicked to display the laid-open patent official gazette 1014 
corresponding to the displayed laid-open patent official gazette 1014 on 
the monitor. 

The pieces of information obtained by searching the patent 
information search apparatus 103 include not only English information as 
shown in Fig. lOB but. . . 

.translation engine incorporated in the patent information search 
apparatus 103) to translate the laid-open patent official gazette 
described in Japanese into a language that can be understood by the user 
...search apparatus 103 to estimate the translation fee or order 
translation . 

In addition, in the patent information search system of the present 
invention, when a foreign country has the priority right... 
TO-ZAI-NAN-BOKU KOGYO in Japanese. However, search of patent databases 
based on this translated word is meaningless. In this embodiment, a 
search word is converted into another search word. . . 

.the option for searching the full text search DB 116 is selected, a 
laid-open patent official gazette corresponding to the clicked 
publication number 1002 is transmitted to the client terminal... 



..table 1102 is acquired from the full text search DB 116. 

By looking up the conversion table 13 02 shovm in Fig. 13B, Japanese 
character strings representing the non-textual contents such ... recognized 
character strings are replaced with corresponding English character 
strings by looking up a predetermined conversion table. 

Next, the acquired English abstract 1401 and image data 1405 in which 
Japanese character strings representing the non. . . 
.information search apparatus 103 calculates an estimate of the 
translation fee for a laid-open patent official gazette requested by 
the user in step S215 (Fig. 2B) {step S701) . The calculated fee estimate 
is transmitted to the client terminal 101 (step S702)-. When a formal 
translation request is received from the user who had confirmed the 
fee estimate (step S704), the patent information search apparatus 103 
requests a translation agency or the like to translate the laid-open 
patent official gazette designated by the user using, e.g., an e-mail. 

That is, not the patent information search apparatus 103 but a 
translation agency well versed in various technical fields is charged 
with translation. With this arrangement, the load on the patent 
information search apparatus 103 can be reduced, and in addition, the 
translation accuracy increases as . . . 

.patent information search apparatus 103, as described above. 

(Modification of First Embodiment) 

The above-described patent information search system of the first 
embodiment can be modified to an arrangement shown in Fig. 15. The same 
reference numerals as in Fig. 1... 

.the relay server, so the search apparatus 103 or database server 113 
need not be changed . Hence, the development cost for constructing the 
system of the present invention is minimized. In... 

.thereof, when a user who cannot understand Japanese is to use a database 
storing Japanese patent information and the like, he/she can obtain 
desired information without being conscious of Japanese. 

( Second Embodiment) 

In the second embodiment, an example in which the patent information 
search system described in the first embodiment is applied to PATOLIS 
provided by Japan Patent Information Organization will be described. 
The arrangement and some processing operations of the patent 
information search system of the second embodiment are the same as in 
the first embodiment, and a detailed description thereof will be omitted. 

The PATOLIS is a commercial database in which information associated 
with patents filed in Japan and the like can be searched for. This 
PATOLIS is constituted such... as the number of laid-open patent official 
gazettes containing a character string can be obtained . 

However, since technical terms which can be used for fee keyword search 
are determined in. . . 

.patent system between countries cannot be solved by word translation. 
More specifically, in accessing a patent database in a certain 
country using another language, although the communication protocols 
for access can be matched, the patent formats cannot be... 

.a search expression or keyword in the language of the country (Japanese) 



having the corresponding patent database is created, no search errors 
occur. 

As described above, according to the second embodiment, by applying 
the present invention to a commercial database such as the PATOLIS, 
even a user who cannot understand Japanese can easily use the commercial 
database such as the PATOLIS. 

As described above, according to the first and second embodiments, 
when a user who cannot understand Japanese wants to use a database 
storing Japanese patent information and the like, he/she can obtain 
desired information without being conscious of Japanese. The present 
invention can also be applied to various database search systems. 

(Other Modifications ) 

In the above embodiments of the present invention, a case wherein a 
user whose native language is English uses the database storing Japanese 
patent information and the like has been described. However, the present 
invention is not limited to. . . 

.a case wherein a user whose native language is French uses a database 
storing German patent information and the like. In this case, a user 
interface such as a menu window. . . 

.present invention is preferably performed in the information search 
apparatus 10 3 of the first or second embodiment from the viewpoint of 
preventing the influence to the database server 113 . However, the 
search expression replacement function may be imparted to the database 
server. . . 

.present invention can be applied to a search apparatus which is not 
connected to a network and, more specifically, a patent information 
search system in which the terminal 101, search apparatus 103, and 
database 113 are integrated. In. . .his/her personal computer or 
workstation . 

In the above embodiments, a case wherein the Japanese patent 
database (PATOLIS) is set in Japan, and a user who is not proficient in 
Japanese accesses. . . 

.only the Japanese PATOLIS system. When the present invention is to be 
applied to a patent database in a language other than Japanese (e.g., 
the U.S. LEX PAT ) , the language used for the patent database and the 
user language must be taken into consideration. For example, the 
present invention can be applied to a case wherein a user whose native 
language is not English accesses an patent database X (this database 
X is set in a country A) which uses English in the country A or. . . 

.line. 

As a further modification of the modification (Fig. 15) of the first 
embodiment, a database constructing function is imparted to the relay 
apparatus. In this case, the relay apparatus sends... 

.apparatus searches the internal database first. Only when no result is 
obtained by searching the database , the relay apparatus sends the 
search request to the database 113 . 
The network applied to. . . 

.the various embodiments as set forth discloses a computer program 
storage medium used for the patent information search system, method 



and apparatus. The medium is defined in the following items (1) to (3): 

(1) A storage medium which stores a program for controlling a patent 
information search system having a patent information search apparatus 
for searching a patent information database on the basis of an input 
search expression and a patent information processing apparatus 
connected to said patent information search apparatus through a network, 
characterized by comprising: 

a program code for replacing an input first search expression with a 
second search expression for searching the patent information database 
, on the basis of bibliographic information search contents of the search 
expression; 

a program code for transmitting the replaced second search 
expression to said patent information search apparatus; 

a program code for searching the patent information database on the 
basis of the received second search expression; and 

a program code for transmitting a search result to said patent 
information processing apparatus. 

(2) A storage medium which stores a program for patent information 
search for searching a predetermined patent information dateibase 
on the basis of an input search expression and outputting a search 
result, characterized by comprising: 

a program code for replacing an input first search expression with a 
second search expression for searching the patent information database 
, on the basis of bibliographic information search contents of the search 
expression; 

(3) A storage . . . 

..by comprising: 

a program code for replacing an input first search expression with a 
second search expression for searching the patent information database 
, on the basis of bibliographic information search contents of the search 
expression; 

a program code . . . 

. .database through the network; and 

a program code for receiving a search result by the patent 
information database and outputting the search result. 
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...ABSTRACT search and display files based on the user's operations of the 
terminals, a file database for accumulating the accessible files 
either within or outside of the information terminals, and a key object 

. . .display window in order to search for other related files, wherein the 
above key object extracting means searches for and retrieves, from the 
above file database , files containing a key object group extracted 
from the above information input and display window and a key object 
group possessing similarity. . . 

...SPECIFICATION And various independent technologies have also been 
developed to provide search utilities for resources over networks . 
However, conventional search tools and interfaces necessitates "search 



key extraction "selection of search field"/ "input using search 
protocol based on search logic", among others, as... 

.an object as operable (or active) exist as well. Due to the advancement 
of large capacity recording apparatuses and the advancement and wide 
popularization of networking , we are simply overflowing with large 
amounts of files. Consequently, the need to apply these... 

.lessening the burden from the information search operations, there is 
the invention of the Japanese Patent Application, First Publication, 
No. Hei 5-307570 "File Management Apparatus". This particular claimed 
invention has the characteristic... 

.and the usage thereof, and to perform the search-specific operations 
every time. 

The Japanese Patent Application, First Publication, No. Hei 5-81328 
"Automatic Key Word Input System" is claimed to reduce the. . . 
.of the claimed invention misleadingly implies. 

The claimed invention relating to the automatic key word extraction 
represented by the Japanese Patent Application, First Publication, 
No. Hei 7-230468 "Automatic Key Word Extraction and Method Therefor" 
proposes an improved method of extracting the most appropriate key word 
from. . . 

.automatic searches including this invention. 

As a method of automatically generating a search format for retrieving 
files stored in a file database , there is the Japanese Patent 
Application, First Publication, No. Hei 5-189492 "Apparatus for 
Automatic Generation of Search Format". However, this apparatus... 

.search and display files based on the user's operations of the 
terminals ; 

a file database for accumulating the accessible files; and 
a key object extracting means for extracting key objects from a 
portion of or from the entire information displayed in the above... 

.display window in order to search for other related files, 

wherein the above key object extracting means searches for and 
retrieves, from the above file database , files containing a key object 
group extracted from the above information input and display window and 
a key object group possessing similarity. . . 

.search for other related files; 

a step for searching for and retrieving, from a file database for 
accumulating the accessible files, files containing a key object group 
extracted from the above information input and display window and a key 
object group possessing similarity ... representing the picture element 
distribution if the key objects are bitmaps. 

When the key object extracting means 10 retrieves a file from the 
file database 20, the determination of the degree of similarity of the 
key objects sent to the. . . 

.procedure illustrated in Fig. 3. With respect to a single file retrieved 
from the file database 20, the key object extracting means 10 first 
performs the determination of the category of the inputted object at 
process S2-1. The... user may refer to the file access histories of other 



users who share the file database or even freely take advantage of 
the accessed information. Moreover, by simultaneously displaying 
multistaged search results such as a. . . 

.CLAIMS search and display files based on said user's operations of the 
terminals ; 

a file database for accumulating accessible files; and 
a key object extracting means for extracting key objects from at 
least a portion of the information displayed in said information 
input and display window in order to search for related files, 

wherein said key object extracting means searches for and 
retrieves, from said file database , files containing said key 
object group extracted from said information input and display 
window and a key object group possessing similarity greater... 

.search for other related files; 

a step for searching for and retrieving, from a file dateibase for 
accTimulating the accessible files, files containing a key object 
group extracted from said information input and display window and 
a key object group possessing similarity greater. . . 
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Detailed Description 

Claims 

English Abstract 

. . .Through retrieving compliant HTML pages, a search engine forwards 
retrieved pages to an object oriented database which sorts received 
information by the information's internal organization structure. By 
searching the information as stored in. . . 

Detailed Description 

A SYSTEM, METHOD, AND MEDIUM FOR RETRIEVING , ORGANISING, AND UTILIZING 
NETWORKED DATA The following description includes some copyrighted 
material. While Applicants do not object to the copying of this 
specification for patent related. . . 

. . .present invention. 

BACKGROUND OF THE INVENTION 
I . Technical Overview 

The present invention relates to a networked , information gathering 
and delivery system. More specifically, the present invention relates to 
a network based ( intranet-based ... categorized information is hereinafter 
referred to as a "portal" as the portal acts as a primary interface to 
the organized information residing on the network of computers. When a 
portal is located on a user's local site, the user... 



. .Also, for simplicity, the network of computers is referred to as the Web 
or the Internet . However, it is understood that any networked group 



of computers storing 5 organized information is included within the 
scope of the invention. 

The various users of ... resources available to the players involved. 
Portal 201 includes a Web site server 202, an online page generator 204 
(also known as a protocol translator ) , a searching system 205 (referred 
to herein as a Web 
9 

Crawler), search engines 2 03 ... qualifiers , the system attempts to find 
the desired rectifier with the most important values satisfied first . 

The web site server 202 converts the received results from search 
engine 2 03 into HTML and forwards or "serves" the created page to... 

.lacks the capability or desire to support organized pages locally) 
retrieves pages 3 07 into an Internet browser (software running on user 
304 's computer which translates HTML pages and displays the pages on 
user 304 's display) . 

Pages 307 include at ontologies. These ontologies are translated into 
an HTNE form and sent to user 304 's Internet Browser 305 as template 
pages 307. User 304 next populates the received template pages 307... 

.populated template pages 3 06, then user 304 forwards the populated 
template pages 306 as formatted web pages 308 to ISP 303 for storage 
. Next, web crawler 3 10 scans ISP for complying pages 3 08 and retrieves 
them as... 105 which has the capability to publish its own 
protocolcompliant pages. The supplier 105 's site includes a desktop 
framework 401, an Internet browser 305, protocol translator 402, web 
server 403, legacy applications 405, and legacy databases 406. An 
operator at desktop... 

.resulting page 404 may be transferred to web server 403 for posting. On 
request, the compiled pages 404 may be transferred to other sites 

(for example, users' portals 102) as pages 410. Alternately, the protocol 
translator may map data... may act like server 403 in Figure 4. To this 
end, legacy applications 507, protocol translator 505, legacy database 

506 may function similarly to that of the corresponding elements in 
Figure 4 . 

Tool suite ... tracking component 608 are a reverse index database (RIDB) 
61 1 and an object oriented database (OODB) 6 10. Crawler 609 may 
receive pages from a number of sources including a supplier's web 
site 601, a sector portal web site 616 (publishing pages and/or 
ontologies 634), or a private project web site 606. Crawler 609 
parses the received pages into character strings and stores the strings 
in RIDB 611 where RIDB 611 indicates... 

.ontology tracking component may store the found protocol data in OODB 6 
10. In storing the found protocol data in OODB 6 10, ontology 
tracking component 608 may control the OODB 610 to store the... or 
elemental in nature as that it is combined with other information stored 
in the OODB (other atoms I 0 or elements, collectively "items") and 
provided to a user. The user requests the items from portal 607 and 
receives data 625. Project knowledge base manager 604 receives data 625 
and stores the data in database 605. Here, the received data 625 may 
be cataloged according to current projects a user is working. Also, the 
database 605 may also contain catalogs of items. For example, a user may 
want to build a device that requires a first component and a second 



component. The user searches via a search query (Boolean, hierarchical, 
parametric . . . 

.base manager may store the information relating to items A and B 
together in a first catalog in database 605 and items C and D 
together in a second catalog in database 605. Accordingly, when the 
user needs to evaluate or try a different item (part, process ... use in 
the creation of end items; while at least one goal of the project 
database is to store the created end items (products or processes) as 
they evolve . 

Once a user completes an. . . 

.616) and for information (627) from internal information providers 
(606) . 

2) the portal stores the received information; 

3) the portal dynamically creates web pages for a user based on the 
stored 

information; and, 

4) provides links to the additional... 

.tools may autonomously access the information stored in the portal 607 
or stored in supplier web sites 601 to compile and assist users in 
defining end items. 



aim 

I A system for organizing information comprising: 

a data source storing data including organizational terms; 

a network connected to said data source; 

a portal connected to said network for retrieving the data... 
.2 1 

. The system according to claim 8, wherein said organizational manager 
dynamically organizes said data based an ontology developed from the 
received organizational terms. IL The system according to claim 1, 
wherein the organizational terms include at... 

.descriptor includes a method identifier. 

15 A system for organizing information comprising: 

a data source storing data including organizational terms; 

a network connected to said data source; 

a portal connected to said network for retrieving the data. . . 

.video display. 
22 

. The system according to claim 16, wherein said portal further 
comprises : an extraction device for extracting the categorized data 
from said storage device. 

21 The system according to claim 15, wherein said portal further 
comprises: an extraction device for extracting the categorized data 
from a storage device, wherein said extraction device extracts the 
categorized data based on Boolean search criteria. 

22 The system according to claim 15, wherein said portal further 



comprises: an extraction device for extracting the categorized data 
from a storage device, wherein said extraction device extracts the 
categorized data based on hierarchical search criteria. 

23 The system according to claim 15, wherein said portal further 
comprises: an extraction device for extracting the categorized data 
from a storage device, wherein said extraction device extracts the 
categorized data based on parametric search criteria. 5 24. The 
system according to claim 20, wherein said storage... 



.of class, method, and attribute criteria. 



25 The system according to claim 20, wherein said storage device 
receives said data from a reverse index database . 



26 The system according to claim 20, further comprising: 

an organizational manager for organizing said data based on a 
previously stored ontology of the organizational terms for storage in 
said storage device. 

27 The system according to claim 20, further comprising: 

an organizational manager for dynamically organizing said data based 

an ontology developed from the received organizational terms. 

23 

. The system according to claim 15, wherein the organizational terms 
include at . , . 



.according to claim 32, wherein said categorizing step further includes 
the step of 

organizing the received data based on the organizational terms. 
36 The method according to claim 35, wherein said organizing step... 



. terms . 



37 The method according to claim 35, wherein said organizing step 
dynamically organizes said data based an ontology developed from the 
received organizational terms. 

38 The method according to claim 33, wherein the organizational terms 
include at . . . a storage device. 

25 

. The method according to claim 43 , further comprising the step of 
extracting the categorized data from said storage device. 

45 The method according to claim 43, further comprising the step of 

extracting the categorized data from said storage device based on 
Boolean search criteria. 



46 The method according to claim 43, further comprising the step of 

extracting the categorized data from said storage device based on 
hierarchical search criteria. 



47 The method according to claim 43, further comprising the step of 

extracting the categorized data from said storage device based on. 
parametric search criteria. 



48 The method according to claim 43, wherein said. 



...of class, method, and attribute criteria. 

49 The method according to claim 44, wherein said storing step 
receives said data from a reverse index database . 

50 The method according to claim 44, further comprising the step of 
organizing said data based on a previously stored ontology of the 
organizational terms for storage in said storage device. 

51 The method according to claim 44, further comprising the step of 
dynamically organizing said data based an ontology developed from the 

received organizational terms. 

52 The method according to claim 42, wherein the organizational terms 
include at . . . 
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Detailed Description 

case, CD-ROMs have been used to export 
databases to multiple users so that information storage and 

retrieval takes place at the user site , 
In the U,S. Patent No, 5,010,571 to Ron Katznelson and 

the U,S, Patents Nos, 4,827 provide a method and apparatus for retrieving 
secure 

information from a mass storage medium at a user site which 
is not susceptible to attack or compromise by a user. 

It is a further. . . 
. . .of the present invention to 

provide a system for retrieving secure information from a 

mass storage medium at a user site which does not require a 

reconfiguration of a personal computer at the user site, 

These... on the computer screen, print out a hard copy 

and/or transmit a copy by LAN or modem to another 

In accordance with the SCSI standard, the SCSI bus 

extends up to twenty-six feet... 
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52 32 8097 REQUEST? OR DATA () MINE? OR DATA {) MINING? OR DATAMINE? OR D- 
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generate a patent extract , and registers extract into patent 
database 

. . .Abstract (Basic) : NOVELTY - A patent extract storing unit (103) 
extracts predetermined data e.g. application data, detailed summary, 
drawing data to automatically generate a patent extract . The 
patent extract is then registered into the patent database . 
DETAILED DESCRIPTION - A patent document storing unit (102) 
classifies every documented application, detailed statements, detailed 
summary, and detailed drawing of the patent document input into a 
terminal equipment, and stores the data into a patent database (104 

...ADVANTAGE - Reduces processing burden. Reduces time required for loading 
and observing search document since amount of documents that needs to 
be confirmed are reduced. DESCRIPTION OF DRAWING (S) - The figure shows 
the structural diagram of the automatic patent - extracting 
production system, (102) Patent document storing unit; (103) 
Patent extract storing unit; (104) Patent database . 

. . .Title Terms: DATABASE 
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construction method and a system thereof are provided to process 
raw data obtained in a patent information retrieval system, and to 
output various types of patent maps on a computer monitor or to. . . 

... The, method comprises steps of downloading a text file resulted 

from a search operation on a specific subject in a patent 
information retrieval system (S700 ) , generating a project data file 

constructed from the text file and storing the project data file 
at a storage device(S712, S714) , generating a memo data file with a 
level, erase or specific memo on individual patent data of the 
project data file (S715) , generating analysis data file 
including analysis methods, classifications, and charts and storing the 
analysis data file at a storage device (S718 ) , classifying the 
patent source data by the classification and analyzing the patent 
source data by the analysis method, extracting the analysis data 
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.Abstract (Basic) : The data copyright management system has a datadjase 
and a key control centre for managing copyright in the event that a 



primary user edits a copyrighted primary data. The latter is... 

...centre sends the primary use permit key to the primary user upon receipt 
of a request for distribution of the primary key from the primary- 
user who requires use of the data. The primary user decrypts the 
copyrighted primary data to plain-text using the received primary 
use key for primary utilisation of the data. . . 
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ABSTRACT 

PROBLEM TO BE SOLVED: To shorten tlie network administration time by- 
updating tlie configuration of a target device according to configuration 
data . 



SOLUTION: A hypertext transfer protocol (HTTP) server receives a URL - 
encoded request from a browser 83 and processes it. A URL in it specifies 
one CGI script . . . 

. . . execute the CGI script so as to dynamically generate a response to the 
URL-encoded request The response sent to the browser 83 visually 

presents the status and configuration of the target device. The CGI script 
specified by the URL-encoded request is executed so as to update the 
status and configuration of the target device and then the HTTP server 103 
changes entries of the status and configuration in a data base 105 
regarding the device . 
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ABSTRACT 



PROBLEM TO BE SOLVED: To improve a reproduction rate while the precision of 

retrieval is highly maintained. 
SOLUTION: A computer 1 reads patent literature (primary information) from 
a data base 3, extracts secondary information on a bibliographical 
item and an index word and stores them in an auxiliary storage device 
7. A user inputs a retrieval expression by using a keyboard 4 and a 
pointing device 5 while he views a... 

. . . the coappearance of the index word is regulated by an OR approximate 
operator in the retrieval expression, the computer 1 refers the inputted 
index word with the index word extracted from. . . 
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ABSTRACT 



. . . between different types of databases is realized while reducing the 
concentration of access frequency, since URL information is acquired by 
registering data in plural types of DBMS lOh based on the vocabulary to be 
included in Internet document collected by a retrieval robot 10a and 
simultaneously extracting the registered data from the DBMS lOh as 
corresponding type according to an inquiry character string. 
COPYRIGHT : (C) 1999, JPO 
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ABSTRACT 



PROBLEM TO BE SOLVED: To retrieve a Web page with the same mark without 
retrieving a key word nor generating a link directory when the Web page 



IS 



retrieved 



SOLUTION: A mark ID etc., is implanted in a specified mark image by 
responding a mark request from a WWW ( world wide web ) server 102, 
information on the Web page corresponding to the mark ID is registered in 



... server 103. The mark is attached to the created Web page and registered 
in a Web page DB 3 021 by the server 102. The Web page is acquired 
from the server 102, the information implanted in the mark is read and a 
request for retrieval by the mark is issued to the server 103 by a 
client terminal 101. Information... 

...to the specified mark ID is transmitted to the terminal 101 by referring 
to the DB 2021, by the server 103. 
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. . .means deriving data record in accordance with a derivation rule obtained 
in a derivation rule retrieval means. 

SOLUTION: A data read means 10 receives a data derivation command and reads 
one data record from a database 60. The data record which is read is sent 
to the derivation rule retrieval means 20. The derivation rule retrieval 
means 20 retrieves a derivation rule group 50 in accordance with the 
content of the sent data record. . . 

...record which is read by the data read means 10 are sent to the data 
derivation means 30 from the derivation rule retrieval means 20. The 

data retrieval means 3 0 generates the derived data record from the 
original data record in accordance with the description of the obtained 
derivation rule and sends the generated derivation data record to a 
data - base write means 40. 
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ABSTRACT 

...in accordance witli the secular change of the PI. Then PI to be published 
is retrieved by referring to the reference data and history data 
corresponding to the retrieved PI are referred to. Then PI having history 
changed after the preceding segmentation time is extracted while 
determining edition processing. For instance, a patent information 
edition file 40 storing , a patent data base 10 and preceding 
edition contents related to patent information is included in the storage 
means . 
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ABSTRACT 

...BE SOLVED: To accurately grasp tlie gist of an invention and to improve 
tlie Jceyword retrieval efficiency by meclianically extracting }ceyword data 
as it is from tlie range, etc., of a... 

...of Japanese syllabary), signs and Roman latters, etc., is extracted from 
tlie document information of patent application such as the range, etc., 
of tlie patent demand which is stored in the external storage means 3 
and second keyword data consisting of the character strings of 
HIRAKANA (cursive form of Japanese syllabary) is extracted . Then, third 
Iceyword data consisting of specified KANJI is obtained in a non- 
extraction state. 
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Abstract (Basic) : 

... A method for analyzing and using intellectual property 

information and a system thereof are provided to mutually exchange 
views on information on intellectual property by accessing to 
intellectual property information database on line through a period 
and a keyword fixed, and by providing the information to research 
workers and persons in charge. 
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includes information on all sorts of intellectual properties . An 
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